Foreign accent classification using source generator based prosodic features
نویسندگان
چکیده
Source Generator Based Prosodic Features John H.L. Hansen and Levent M. Arslan Robust Speech Processing Laboratory Duke University Department of Electrical Engineering Box 90291, Durham, North Carolina 27708-0291 ABSTRACT Speaker accent is an important issue in the formulation of robust speaker independent recognition systems. Knowledge gained from a reliable accent classi cation approach could improve overall recognition performance. In this paper, a new algorithm is proposed for foreign accent classi cation of American English. A series of experimental studies are considered which focus on establishing how speech production is varied to convey accent. The proposed method uses a source generator framework, recently proposed for analysis and recognition of speech under stress[5]. An accent sensitive database is established using speakers of American English with foreign language accents. An initial version of the classi cation algorithm classi ed speaker accent from among four di erent accents with an accuracy of 81.5% in the case of unknown text, and 88.9% assuming known text. Finally, it is shown that as accent sensitive word count increases, the ability to correctly classify accent also increases, achieving an overall classi cation rate of 92% among four accent classes. 1
منابع مشابه
FOREIGN ACCENT CLASSIFICATION USING SOURCE GENERATOR BASED PROSODIC FEATURES - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Speaker accent is an important issue in the formulation of robust speaker independent recognition systems. Knowledge gained from a reliable accent classification approach could improve overall recognition performance. In this paper, a new algorithm is proposed for foreign accent classification of American English. A series of experimental studies are considered which focus on establishing how s...
متن کاملVoice morphing and the manipulation of intra-speaker and cross-speaker phonetic variation to create foreign accent continua: a perceptual study
The STRAIGHT system of voice morphing was used to create voice continua of (Korean) accented Australian English, intended to simulate phonetic variation ranging from ‘heavily accented’ to ‘unaccented’ (native-like) Australian English, employing dimensions of intra-speaker and cross-speaker variation to yield a range of synthetic voices. These synthetic voices were evaluated against actual sampl...
متن کاملAutomatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features
Many acoustic approaches to prosodic labeling in English have employed only local classifiers, although text-based classification has employed some sequential models. In this paper we employ linear chain and factorial conditional random fields (CRFs) in conjunction with rich, contextually-based prosodic features, to exploit sequential dependencies and to facilitate integration with lexical feat...
متن کاملA corpus-based analysis of transfer effects and connected speech processes in Vietnamese English
This paper presents a corpus-based descriptive analysis of the most prevalent transfer effects and connected speech processes observed in a comparison of 11 Vietnamese English speakers (6 females, 5 males) and 12 Australian English speakers (6 males, 6 females) over 24 grammatical paraphrase items. The phonetic processes are segmentally labelled in terms of IPA diacritic features using the EMU ...
متن کاملForeign accent identification based on prosodic parameters
In this paper we propose an automatic approach for foreign accent identification. Knowledge of the speaker’s origin allows to adapt acoustic models for non-native speech recognition. In this study, we use a statistical approach based on prosodic parameters. This approach relies on the fact that prosody is different between languages, and has been done within the framework of the HIWIRE (Human I...
متن کامل